Random road networks: the quadtree model 
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Abstract 

What does a typical road network look like? Existing genera- 
tive models tend to focus on one aspect to the exclusion of others. 
We introduce the general-purpose quadtree model and analyze its 
shortest paths and maximum flow. 

1 Introduction 

Many optimization problems naturally feature a road net- 
work. Computing driving directions can be viewed as a 
shortest-path problem, and at a higher level, a typical vehi- 
cle routing problem is to schedule a fleet of capacity-limited 
trucks making deliveries from a depot to several customers. 
Algorithms that solve these kinds of problems on general 
graphs are often impractical, leading algorithm designers to 
consider the special structure of road networks. How can we 
characterize this structure? 

Viewing the US road network as a metric space, Bast, 
Funke, and Matijevic found a small set of transit nodes that 
covers almost all of the shortest paths [3] . The existence of 
such a set underlies their point-to-point shortest-paths al- 
gorithm and was later formalized by Abraham, Fiat, Gold- 
berg, and Werneck as highway dimension in order to explain 
the performance of several competing algorithms ^1 . Kala- 
pala, Sanwalani, Clauset, and Moore examined several na- 
tional road networks and discovered evidence of scale invari- 
ance |14| . Aldous has explored the axiomatic consequences 
of scale invariance in road networks [5]. Masucci, Smith, 
Crooks, and Batty studied the London road network, find- 
ing, among other things, that the length of a shortest path is 
usually competitive with the distance between its endpoints 
and that most pairs of points are connected by a path with 
few turns [16] . 

Topologically, road networks have few crossings involv- 
ing roads at different elevations, leading many researchers, 
including Masucci et al., to view road networks as planar 
graphs. Eppstein and Goodrich, however, dispute both this 
characterization and a narrower one that excludes highways, 
citing evidence that in the US road network, crossings be- 
tween local roads number on the order of ^/n, where n is the 
number of intersections [8]. They introduce as an alterna- 
tive the notion of multiscale-dispersed graphs. Algorithmi- 
cally, one key property of planar graphs [TJ] and multiscale- 
dispersed graphs [5] is that they have separators of size 
0{y/n), whose removal leaves two subgraphs of approxi- 
mately the same size disconnected from one another. Fred- 
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erickson was the first of many to design separator-exploiting 
graph algorithms [llj . Other properties exclusive to planar 
graphs are exploited by algorithms such as the O(nlogn) 
maximum sf-flow algorithm due to Borradaile and Klein [5] , 
which can be used for large-scale empirical experiments with 
the quadtree model. 

Despite the role of road networks in practical optimiza- 
tion, there have been few attempts to synthesize these prop- 
erties into a generative model. Ideally, such a road network 
model would be realistic enough to yield new insights, yet 
mathematically tractable so as to be useful in validating new 
specialized algorithms. Motivated by the success of genera- 
tive models in the study of social networks [17j , we introduce 
a new model of random road networks based on quadtrees. 

1.1 Existing models 

One natural proposal is the uniform random planar 
graph. It is well studied and has efficient sampling algo- 
rithms [m [T5] , but it has more high-degree nodes than real 
road networks (TB] and does not come with a geometric em- 
bedding. Gerke, Schlatter, Steger, and Taraz analyze an 
incremental random planar graph process where uniform 
random edges that do not complete a forbidden subgraph 
are added to an initially empty graph |13j . It has the same 
drawbacks, and a variant where straight edges are added be- 
tween nodes chosen uniformly at random in the unit square 
is considered and rejected by Masucci et al. as being too 
difficult to navigate [TB]. 

Abraham et al. describe an incremental road network con- 
struction algorithm that takes as input a sequence of points 
in a metric space pj . Sets of input points Co 3 Ci 3 • • • are 
maintained in such a way that Ci is a maximal set of points 
at distance greater than 2' from one another. For all i, roads 
are added between points p,q € Ci with dist (p, q) < 6 • 2\ 
For all p e Ci-i — Ci, a. road is also added from p to the 
closest point in Ci. In essence, this algorithm creates local 
road networks at different scales, together with connections 
between the scales. It is introduced as motivation for the 
authors' hypothesis that real road networks have small high- 
way dimension and is not intended to capture the topology 
of a road network. 

Kalapala et al. introduce a simple model of fractal road 
networks in order to study scale invariance jl4j . The 
quadtree model differs from this model in using a random 
tree whose distribution is controlled by a sprawl parameter, 
instead of a tree complete to a specified depth. Visually, the 
road networks produced by the model of Kalapala et al. are 
too uniform to resemble real road networks. 
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Masucci et al. propose the Growing Random Planar 
Graph (GRPG) model, which is based on their statistical 
analysis of the London road network [16]. The GRPG model 
starts with a single vertex and constructs a planar-embedded 
tree incrementally. At each step, a random parent vertex is 
chosen along with a random offset, and a child is placed off- 
set from the parent as long as the new edge does not cross an 
existing edge. Short edges are added to obtain the finished 
product. This model seems difficult to analyze formally in 
light of its brute-force approach to ensuring that the graph 
is planar, and its creators make no attempt to do so. 

1.2 The quadtree model 

The quadtree model has two parameters: n, the number 
of intersections, and r, which controls the amount of sprawl. 
Imagine a square tract delimited by roads, to which n people 
immigrate in succession. Each new arrival chooses an exist- 
ing square at random, builds two road segments bisecting it 
horizontally and vertically, and settles at the newly-created 
intersection. This intersection is at the corner of four new 
delimited squares. If people want to be far from other peo- 
ple (r < 1), then they prefer to settle larger squares. Con- 
versely, if they want to be close (r > 1), then they prefer 
smaller. Figure [T] shows the effects of different preferences. 
Figure [2] presents instances of the quadtree model and the 
GRPG model side by side with a real road network. 

Road networks generated by the quadtree model have sev- 
eral desirable features. They are embedded in the plane, 
self-similar, and for some choices of r, they display real- 
istic variations in density. Between any two points there 
exists a hierarchically organized path whose length is within 
a constant factor of their Euclidean distance and that with 
high probability when r < 1 makes O(logn) turns (Theo- 
rem [ij Section |4]) . This conforms with the observation of 
Masucci et al. that London is efficiently navigable in both 
senses [TB]. Unlike the GRPG model, the quadtree model is 
simple and avoids an unrealistic abundance of acute-angled 
intersections. Although the quadtree model does not specify 
speeds or capacities for roads, the technique of Kalapala et 
al. can be used to assign both based on the depth of each 
road in the tree [13]. Different subdivision patterns can be 
used to avoid the monotony of square lots; we anticipate that 
our techniques will still apply with straightforward modifi- 
cations. 

1.3 Maximum flow on road networks 

The capacity of road networks has received far less atten- 
tion than shortest paths, yet for urban planners who must 
manage rush hour traffic and the possibility of natural dis- 
asters, it is a very important quantity. Imagine that a large 
number of drivers wish to travel from the west edge of the 
road network to the east: how does sprawl affect the rate at 
which they succeed? 

For simplicity, we assume that each road has unit capacity. 
We anticipate that our techniques extend straightforwardly 
to more realistic settings where low-level roads have less ca- 
pacity. We measure the capacity of a road network by the 



maximum value of an east-west flow, which by our assump- 
tion equals the maximum number of edge-disjoint east-west 
paths. By the max-flow min-cut theorem [7l[T0], the capac- 
ity of a road network is also equal to the minimum total 
capacity of roads that when removed, disconnect the east 
from the west. 

We obtain a partial characterization of maximum flow in 
the quadtree model by analyzing the capacity of a random 
vertical cut for upper bounds and by embedding a partial 
grid for lower bounds. When r < 1, the road network 
asymptotically resembles a grid, with maximum flow on the 
order of -Jn in expectation (Theorem [2j Section [6|. When 
r > 1, the squares nest very deeply and the maximum flow 
is constant in expectation (Theorem |4j Section [8| . These 
results are proved by analyzing a continuous-time variant 
of the quadtree process with exponential random variables. 
When r = 1, we have been able to prove only that a lower 
bound is S7(n°'°®^ ") and an upper bound is 0{'n}/^)^ con- 
firming the existence of a phase transition (Theorem [sj Sec- 
tion [t]) . The proofs of both bounds involve a generating 
function. Experimentally, the true exponent lies somewhere 
between the two, but its exact identity is unknown. (Note 
that the quadtree in this case is uniform random.) 

Previously, Eppstein, Goodrich, and Trott studied the ex- 
pected number of roads intersected by a random line, both 
theoretically (for multiscale-dispersed graphs) and experi- 
mentally [S] . They proved for arbitrary multiscale-dispersed 
graphs that this number is 0{^Jri) but found that for real 
road networks it is significantly smaller. The same number 
for the quadtree model with r = 1 is Q{'n}^^)^ which to- 
gether with visual evidence, suggests that r = 1 — e may be 
a realistic choice. 

2 Formal model description and 
summary of results 

Let S = {Nw^ Ne, Sw, Se} be an alphabet on four sym- 
bols. We view strings as nodes in an infinite quadtree rooted 
at the empty string A, where the children of each node 
w € S* are wa for all a € E. In turn, we view these nodes 
as squares. Starting with the unit square A, we cut each 
square horizontally and vertically to obtain its four children, 
as shown in Figure [3} Each square u; S S* so obtained is 
axis-aligned and has side length 2~l"'l. 

Let r S (0, oo) be a parameter. We specify a random grow- 
ing road network as a sequence of sets of squares Leaves„. 
Initially, there is only the unit square. 

Leaveso — {A}. 



For all integers n > 0, we choose a random square Split„ e 
LeaveSn to subdivide. When r < 1, larger squares are more 
likely to be chosen. When r > 1, smaller squares are more 
likely. When r = 1, the choice is uniform. The distribution 
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Figure 1: the quadtree model. Left: r 
(small-square preference) . 
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3/4 (large-square preference). Middle: r = 1 (neutral). Right: r — 5/4 
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Figure 3: Splitting square w. 
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of Split„ is given by 



For all times t £ [0,oo), let 



Vw e LeaveSn, 

Pr(Split„ = V I LeaveSn) = 



-ujfELcavcsTi 



r\w\ ■ 



To obtain the next set of squares, we replace Split„ by its 
children. 

Leaves„+i = (Leaves„ — {Split„}) U {Split„a : a S S}. 

We call this model the quadtree model for random road net- 
works. In the terminology of Bertoin |4j 1.1], it is a con- 
servative fragmentation chain with index of self-similarity 
a - log4(l/r). 

For all integers n > 0, let Gn be the graph of the square 
subdivision corresponding to Leaves„. Let Left„ be the set 
of vertices with a; = and Right„ be the set of vertices with 
X = 1. Define MaxFlow„ to be the maximum size of a set 
of pairwise edge-disjoint paths in G„ from vertices in Left„ 
to vertices in Right„, that is, the maximum flow in G„ from 
Left„ to Right„ assuming unit-capacity edges. 

Theorem 1. Let w^x £ Leaves„ he squares, let p be a point 
on the boundary of w, and let q be a point on the bound- 
ary of X. Define v to be the least common ancestor of w 
and X. There exists a path in G„ from p to q that changes 
direction at most 3((i^ + rfi) + 6 times and has length at 
most 2disti(p,g), where dw,dx are the depths ofw,x in the 
subtree rooted at v. 



Theorem 2. If r < 1, then 



< E[MaxFlow„] < 0{n^'^). 

The lower bound for r < 1 is very close to the upper 
bound; in this case, for all e > 0, we have E[MaxFlow„] ~ 
f](ni/2-^). 

Theorem 3. If r ^ 1, then 

r2(n°-"^^-) < E[MaxFlow„] < 0{n^/^). 
Theorem 4. If r > 1, then E[MaxFlow„] = 9(1). 

3 Continuous-time reformulation 

When r 7^ 1, it is convenient to work with a continuous- 
time variant of the quadtree process. In switching to contin- 
uous time, we lose fine control over the number of intersec- 
tions but gain independence between disjoint subtrees. The 
former is not an issue for us. 

For all squares w S S*, let Lifespan(w) be an independent 
exponential random variable with rate r'™'. Define 



Birth(?i;) 



E 



Lifespan(w) 



t; is a proper ancestor of w 



Death(w) = Lifespan(ti) 

V is an ancestor of w 

= Birth(u') + Lifespan(?i)). 



Alive(t) = {w : Birth(w) < t < Death(u;)}. 

For all integers n > 0, let 

T{n) ^ M{t : #Alive(<) = 3n + 1}. 

We assume, as is the case with probability 1, that T is ev- 
erywhere defined. The following proposition can be proved 
inductively by coupling the discrete process with the con- 
tinuous process. 

Proposition 5. For all integers n > 0, the random sets 
Leaves„ and Alive(r(n)) are identically distributed. 

In the proofs that follow, we assume that Leaves„ = 
Alive(T(n)). 

4 Navigation (Proof of Theorem 1) 

The key to short paths with few turns is using the hier- 
archical organization of the road network. In essence, the 
proofs in this section work by replacing squares on a path in 
the quadtree with pieces of their boundaries (and possibly 
their siblings'). Note that when r < 1, with high probability 
the quadtree has height O(logn). 

Lemma 6. Let w € Leaves„ be a square. Let p be a point 
on the boundary of w and let v be an ancestor of w. Define q 
to be the unique point on the boundary of v that is due north 
(respectively, west, east, south) of p. There exists a path in 
Gn from p to q that changes direction at most 3d -|- 2 times 
and has length at most 2disti(p, g), where d is the depth of 
w in the subtree rooted at v. 

Proof. (See Figure|4]) We induct on d. If d = 0, then w = v, 
and p is on the boundary of v. Take the shortest path on 
the boundary of v. 

If d > 0, then let a S S be the unique symbol such that 
v' = va is an ancestor of w. Define q' to be the unique 
point on the boundary of f ' that is due north of p. By 
the inductive hypothesis, there exists a path in G„ from 
p to q' that changes direction at most 3d — 1 times and 
has length at most 2disti(p, g'). If a € {Nw, Ne}, then 
q' = q, and no more travel is required. Otherwise, from 
q' , we change directions at most 3 more times in taking 
the shortest path to q on the boundary of vNw (if a = 
Sw) or vNe (if a = Se)- The total path length is at most 
2 disti(p, 9') -I- 2 disti(g', g) = 2disti(p,g), since q' is on the 
line segment from p to q. □ 

Proof of Theorem^ Recall that w,x G Leaves„ are squares 
whose least common ancestor is v. The depths oi w,x in 
the subtree rooted at v are du,,dx. The point of departure 
p lies on the boundary of w, and the point of arrival q lies 
on the boundary of x. Our goal is to find a path from p to 
q that changes direction at most 3(c?u, + d^) +6 times and 
has length at most 2 disti(p, g). 
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Figure 4: Left: Lemma [g] with w = vSw^w and p as shown. Right: Theorem [l] with w ^ vSw^w and x ^ vNeSw and 
p,q as shown. 



lfv = w — x, then take the shortest path on the boundary 
of V. Otherwise, w,x =/= v, since no square in Leaves„ has a 
proper ancestor in Leaves„. Assume by rotational symmetry 
that p is on or below the horizontal bisector of v and that 
q is on or above the bisector. Let g' be the unique point on 
the bisector due north of p and let p' be the unique point on 
the bisector due south of q. Apply Lemma[6]to obtain paths 
from p to g' and from p' to q. Join these paths with a seg- 
ment from g' to p' to obtain a path that changes directions 
at most (3d^-|-2)-|-2-|-(3c?2: + 2) times and has length at most 
2disti(p,g') + disti(g',p') + 2disti(p',g) < 2 disti(p, g). □ 

5 Maximum flow 

To bound MaxFlow„ below, we look to embed a regular 
grid in G„. When r = 1, it is advantageous to consider an 
incomplete grid, with holes. To bound MaxFlow„ above, we 
invoke the max-flow min-cut theorem and look for a vertical 
cut with low capacity. 

For all integers c? > 0, let 

Holes„(d) ^ {w : w £ Leaves„, < d} 

be the set of holes at depth d, that is, the set of squares that 
obstruct the embedding of a 2"^ x 2"^ regular grid by having 
height larger than 2^''. 

Lemma 7. For all integers n,d > 0, 

MaxFlow„ > 2^^ + 1 - ^ (2''-l"'l - 1), 

Proof. (See Figure [sj) We attempt to send 2*^-1-1 units of 
flow horizontally. Each square w € Holes„((i) may block up 
to 2'*-l'"l - 1 units. □ 



Lemma 8. For all integers n > 0, 



MaxFlow„ < 1 



E 2- 



\w\ 



Proof. By the max-flow min-cut theorem [7| IIP) , every 
Left„-Right„ cut in the graph G„ has capacity at least 
MaxFloWri- Choose U e (0, 1) — Q uniformly at random 



and form an Left„-Right,j cut by partitioning the vertices 
into those with x < U and those with x > U (all vertices 
have rational coordinates) . The capacity of this cut is equal 
to the number of squares intersected by the line x — U , plus 
one. Each square w g Leaves„ is intersected with probabil- 
ity equal to its width 2^1™', and the conclusion follows by 
an averaging argument. □ 

Lemma |8] leads to an upper bound on MaxFlow„ that is 
tight when Gn is a regular grid. 

Lemma 9. For all integers n>Q, 

MaxFlow„ < (3n + l)^/^ ^ i 

Proof. By Lemma[8] it suffices to obtain an upper bound on 
the total square width plus one, which can be accomplished 
via the Cauchy-Schwarz inequality. Let {wi, . . . , w^n+i} — 
LeaveSn and consider the (3n + l)-dimensional vector x 
where x^, = 2~l"''=l. Denoting by 1 the all-ones vector of 
the same dimension. 



2-l'"l = 1 - x < nil 



2 l|X||2 



by Cauchy-Schwarz 
il||2 = (3n + 1)1/2 



X 2 



□ 



^ 4-H)''' = l 
since 4"'™' is the area of ' 



6 Maximum flow when r < 1 (Proof 



of Theorem 



2) 



Given that large squares are much more likely to be split 
than small squares, it is probable that all square sizes lie in a 
narrow interval, in which case we can embed a relatively fine 
regular grid in Gn. In fact, for any time t, we can say with 
some confidence what the sizes of the squares in Alive(t) will 
be. Lemma [TO] bounds the probability that a small square 
is born early. 
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Figure 5: Left: the asymptotic behavior of E[MaxFlow„] in n for fixed r. Right: Lemma [t] with d = 3. Square NwSe 
blocks 2^^^ — 1 = 1 units of flow, and square Se blocks 2^^^ — 1 = 3. 



Lemma 10. Assume that r < I. For all squares w G S* 
and all integers 5 G [0, 

Pr(Birth(w) < r"*") < rC^i"'). 
Proof. If Birth(i(;) < r^^, then for all proper ancestors v of 



w, it holds that Lifespan(z;) < 



As Lifespan(f) is ex- 



ponential with rate r'"', we have Pr(Lifespan(w) < r = 
1 — exp(— r'^l^'') < min{rl^l^'', 1}. These events are inde- 
pendent, so we multiply their probabilities and obtain the 
stated bound. □ 

Lemma [TT] bounds the probability that a large square dies 
late. 

Lemma 11. Assume that r < I. For all squares w € E* 
and all p £ (0, 1], 

Pr(Death(u;) > E[Death(w)] ln(l/p)) < + l)p. 

Proof. If Death(M;) > E[Death(w)] ln(l/p), then there 
exists an ancestor v oi w such that Lifespan(w) > 
E[Lifespan(z))] ln(l/p). For all v, the probability of the latter 
is p, and w has + 1 ancestors. □ 



Lemma 12 establishes that with probability at least 1/2, 
the last square at depth c—0{y/c) dies before the first square 
at depth c is born. 

Lemma 12. For all r e (0,1), there exist constants co,a 
such that for all integers c > cq, with probability at least 1/2, 



V-y e E", w e E^ 
where a = c — \_a^/c\ . 



Death(w) < Birth(w;). 



Proof. We exhibit a relatively late time {r~^) at which with 
probability at least 3/4, no square at depth c has been born. 
We then choose a so that with probability at least 3/4, every 
square at depth a has been born. More details are in the 
appendix. □ 



We now relate these results to MaxFlow„. 

Proof of Theorem [1[ Lemma [9] gives the upper bound 
E[MaxFlow„] = 0{n^^^). By an averaging argument, there 
exists a square w £ Leaves^ with area at most l/(3n + 1). 
It thus holds that \w\ > c = [log4(3n -1-1)]. Defining 
d = c — [ay/c\ + 1, where the constant a comes from 



Lemma 12 it holds with probability at least 1/2 that 
V e LeaveSn, \v\ < d} = 



Holes„(c?) = {v 
Lemma [7] and obtain the lower bound 



We apply 



□ E[MaxFlow„] > 



1 



,1/2 



expO(vlnn) 



7 Maximum flow when r = 1 (Proof 



of Theorem 



3) 



For all integers 7i > 0, define an ordinary generating func- 
tion 



When r — 1, there is a nice closed form for E[F„(i:)]. This 
is helpful because using F„, we can approximate from below 
the lower bound of Lemma [7] and express exactly the upper 
bound of Lemma [51 

Lemma 13. Assume that r — 1. For all integers n > and 
all t, 



ri-l 



1 



fc=0 



Proof. For all integers fc > 0, 
E[i^fc+i(<) I Leavesfc] = 



1 



At - 1 
3fc + l 



fc=0 



Fk{t). 



3k + 1 



Since the set Leaves^ determines Fk (t) , we can condition on 
Fk{t) alone. The conclusion follows by induction and the 
inequality 1 -I- w < exp u. □ 



6 



Proof of Theorem The upper bound is obtained by writ- 
ing the total square width in terms of _F!„. By Lemma [Sj 

MaxFlow„ < 1 + 2"l"'l ^ F„(l/2) + 1. 



By Lemma 13 and integrating, 

n-l 

E[MaxFlow„] < 1 + exp ^ 

fc=0 



1 



3fc + l 



<exp(l+i^^i^)+l = 0(nV3). 



The lower bound is more involved. To some extent, we em- 
ulate the proof of Devroye of the upper bound on the height 
of a random binary tree [6] . Let c, d, t be defined by 



cln(3c/2) - c + 1/3 = l/ln(3n + 1) 
(c = 0.124 ■• 
d= Lcln(3n + l)J 
t = 3c/ A. 

We have 

MaxFlow„ 



0(l/lnn)) 



>2'^ + l- J2 (2''"l'^l-l) 

itiGHolcsTi (d) 

by Lemma [7] 

since 1/t > 2 and Vw G Holes„(c?), \w\ < d — 1 



> 2 



tx;^ Leaves 

since HoleSn(c?) C Leaves^ and t > 

2'' -r'^Fr.it). 



Take expectations, use Lemma 13 integrate, and substitute 
c, d, t (see the appendix). □ 

8 Maximum flow when r > 1 (Proof 



of Theorem 



4) 



Any square born after infinitely many other squares never 
enters the tree. When r > 1, there exists with probability 1 
a least time U £ [0, oo) for which there exist infinitely many 
squares w e E* with Birth(w) < U . This effect is known 
as the explosion of Markov chains in continuous time. At 
any given depth, most of the squares are born after time ?7, 
so the total width of all squares that ever enter the tree is 
small. 

How can we bound U above? For all squares w G S*, 
define 

CO 

Countdown(ti) = Lifespan(tijV^) 



to be the amount of time elapsed between the birth of v 
and the time at which all of the infinitely many north- 
west descendants of v have been born. Given that r > 1, 
the sum converges with probability 1, and U < Birth(i;) -|- 
Countdown(u). 

Proposition 14. For all squares w € T,* , if there exists 
a square w G S* such that Birth(w) + Countdown(w) < 
Birth(ii;), then for all integers n > 0, we have w ^ Leaves„. 



For Proposition 14 to be useful, Countdown(w) must be 
small some of the time. The significance of the constant 
ln(15/14) is that for aU w e E'', 

Pr(Lifespan(w) < r""* ln(15/14)) = 1/15. 

Lemma 15. Assume that r > 1. There exists an integer 
k > 1 such that for all integers d > and all sets of squares 
1^ C E'' with \V\ = 4k, it holds with probability at least 
15/16 that there exists a square v €V such that 



□ 



Countdown(u) < r"'' ln(15/14). 
Proof. See the appendix. 
For all integers d > 0, define 

oo 

Borurf = E'^ n Leaves„ 

n=0 



to be the set of all squares at depth d that ever enter the 
tree. In expectation, each set Borud is small. 

Lemma 16. Assuming that r > 1, 

E[#Bornd] - 0(d). 

Proof. Rather than analyze ^Born^ directly, we define 
sequences of sets Candidates^ 3 Born^ and Firsts C 
Candid ates ,;. The set Firsts is chosen so that we can apply 
with V = Firsts; the set CandidateSd+i contains 



Lemma 



15 



those squares that can still enter the tree given the quickest 
countdown in Firsts . We prove that for all d, 

E[#CandidateSrf+i | Td] < 16fc(d+ 1) + #CandidateSd/2, 

where the cr-algebra Fd belongs to a filtration that tracks 
the set of observed lifespans, and k is the constant from 
Lemma 15 It follows that E[#CandidateSd] < 32kd. 



The sequences of sets Candidates^ and Firsts are defined 
inductively. Let 

Candidateso = {A} 

consist of the root square and let J-q be the trivial cr-algebra 
(condition on nothing). The set Firsts, which is determin- 
istic given Fd, consists of the 4fc squares v e Candidates^ 
with the least values of Birth('i;) for which Countdown(ti) is 



7 



independent of J-d- (If there are fewer than 4A; such squares, 
take all of them.) Let 

Parents^ = 

{w : w £ Candidates^, 

Vu e Firstrf, Death(z/;) < Birth(u) + Countdown(z;)} 

be a set of squares whose children can still enter the tree. 
Finally, define 

Candidatesd+i = {wa : w € Parents^, a £ E} 

and let J^d+i be the cr-algebra generated by J^d, the random 
variables Lifespan('i;A''^) for v £ Firsts and j > 0, and the 
random variables Lifespan(w) for w £ E'^. 

By construction and Proposition[T4j it holds that Born^ C 
Candidatesd. Of the squares in Candidates^, at most Akd 
have countdowns that are not independent of J-^, one for 
each square in Firsto U • • • U Firstc(_i. At most 4fc belong to 
Firstd- As for the remainder, with probability at least 15/16 
there exists by Lemma [15] a square v £ First,; such that 
Countdown(w) < r"'' ln(15/14). For all remaining squares 
w, 

Pr{w £ Parents^ | J-d) 

< Pr(Birth(?i;) + Liiespan{w) 

< Birth(w) + Countdown(t;) | J^d) 

< Pr(Lifespan('u;) < Countdown(u) | J^d) 

since Birth(?;) < Birth(ii;) 

< 1/15, 

for a contribution of #CandidateSd/4 to 
E[#CandidateSd+i | J^d]- The event where v does not 
exist contributes the same amount. □ 

Proof of Theorem^ The lower bound MaxFlow„ = fi(l) is 
trivial. As for the upper bound, 

MaxFlow„ = 1 + 2-l"'l 
by Lemma [8] 

oo 

oo 

since Leaves^ ^ [J Borriff 

oo 

E[MaxFlow„] = 1 + E '^^'^0{d) = 0(1) 

ti=0 

□ by Lemma [TBI 
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□ 



least 3/4 that 

Vi; e S", Death(u) < r-^ 
that is, that every birth at depth a has occurred. 
Proof of Theorem continued. 

E[MaxFlow„] 

> 2'^ - t-"^ ElFnit)] 

n— 1 - 

>2''-.-exp((4.-l)5: 

fe=0 

by Lemma [T3| 
>2''-t-'^exp((4t-l) ^"^^"^+^^ 

by integrating, since 4i — 1 < 
= 2''(l - (3c/2)-''exp((c - 1/3) ln(3n + 1))) 

>2'^(l- exp([-cln(3c/2) + c - 1/3] \n{3n + 1) 

since 3c/2 < 1 and d < cln(37i -|- 1) 
-2'^(l-exp(-l)) 



Proof of Lemma | j5| Recalling that A is the empty string, 
let 

u = ln(15/14) 

p = Pr(Countdown(A) < u). 
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For all squares w G E'', we have Pr(Countdown(M;) < 
r"'' ln(15/14)) = p. After showing that p > 0, wc finish by 
setting /c > 1 to the least integer such that (1 — p)^*^ < 1/16. 
Define fc > to be the least integer such that 

oo 

< u/4. 

By Markov's inequality, 

Pr(^Lifespan(iV^) < > 1/2. 

j=k 

For all integers j G [0, k — 1], it holds with positive proba- 
bility that Lifespan(Af^) < u/{2k), and all of the terms are 
independent. □ 
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